Systems Group , Department of Computer Science , ETH Zurich Efficient Scan in Log - Structured Memory Data Stores

نویسندگان

  • Kevin Bocksrocker
  • Donald Kossmann
  • Markus Pilman
چکیده

Novel storage solutions like RAMCloud promise never seen before latency by storing all their data in main memory. To efficiently store and retrieve objects from memory some systems reuse ideas from log-structured file systems and utilise a log like data structure to hold the data objects in DRAM. Instead of updating data in place, with log-structured memory any modification to the data generates a new data entry which is appended sequentially to the log’s head. An indexing structure (often a hash table) mapping an object’s key to the location in the log of its most recently written value is used to enable fast random read access to any object by key. Due to the append-only nature the log grows over time and as such a garbage collection algorithm is needed to reclaim space by stripping inaccessible objects from the log. While those systems provide exceptional random access performance (retrieving or writing an object with arbitrary key) they lack an efficient way to perform a scan over the whole data set. Scanning over all currently stored objects usually requires traversing the hash table and retrieving its associated data from the log. This approach has suboptimal performance due to poor CPU cache locality as the data tuples are scattered throughout the log under normal load. Main goal of this master thesis is to devise an optimized log-structured memory data store offering a dedicated scan operation. This scan operation should be as efficient as possible while trying to preserve the very high random access performance the log-structured data layout enables in the first place. The system consists of the implementation of two different approaches: A baseline approach modeling the current state of the art log-structured memory algorithms and one novel approach optimized for both scan and random access operations. These implementations build on top of a full featured data store to read and write key-value pairs in DRAM including optimized memory allocation and garbage collection algorithms. Finally, the different approaches are evaluated regarding their performance under different workloads.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Winter greenhouse gas emissions from a sub-alpine grassland

Winter greenhouse gas emissions (CO2, CH4 and N2O) from a sub-alpine grassland L. Merbold, C. Steinlin, and F. Hagedorn ETH Zurich, Department of Environmental Systems Science, Institute of Agricultural Sciences, Grassland Sciences Group, Universitätsstrasse 2, 8092 Zurich, Switzerland ETH Zurich, Department of Chemistry and Applied Biosciences, Institute for Chemical and Bioengineering, Safety...

متن کامل

Concurrent Log-Structured Memory for Many-Core Key-Value Stores

Key-value stores are an important tool in managing and accessing large in-memory data sets. As many applications benefit from having as much of their working state fit into main memory, an important design of the memory management of modern key-value stores is the use of log-structured approaches, enabling efficient use of the memory capacity, by compacting objects to avoid fragmented states. H...

متن کامل

Memory effects in spin-chain channels for information transmission

Abolfazl Bayat, Daniel Burgarth, Stefano Mancini, and Sougato Bose Department of Physics and Astronomy, University College London, Gower Street, London WC1E 6BT, United Kingdom Department of Physics, Sharif University of Technology, P.O. Box 11365-9161, Tehran, Iran Computer Science Department, ETH Zurich, CH-8092 Zurich, Switzerland Dipartimento di Fisica, Universita di Camerino, I-62032 Camer...

متن کامل

Memory and Object Management in Ramcloud a Dissertation Submitted to the Department of Computer Science and the Committee on Graduate Studies of Stanford University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy

Traditional memory allocation mechanisms are not suitable for new DRAM-based storage systems because they use memory inefficiently, particularly under changing access patterns. In theory, copying garbage collectors can deal with workload changes by defragmenting memory, but in practice the general-purpose collectors used in programming languages perform poorly at high memory utilisations. The r...

متن کامل

Fast neighbor lists for adaptive-resolution particle simulations

Particle methods provide a simple yet powerful framework for simulating both discrete and continuous systems either deterministically or stochastically. The inherent adaptivity of particle methods is particularly appealing when simulating multiscale models or systems that develop a wide spectrum of length scales. Evaluating particle–particle interactions using neighbor-finding algorithms such a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015